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Abstract 

We present the Progressive Second Price auction, a new decen- 
tralized mechanism for allocation of variable-size shares of a resource 
among multiple users. Unlike most mechanisms in the economics li- 
terature, PSP is designed with a very small message space, making 
it suitable for real-time market pricing of communication bandwidth. 
Under elastic demand, the PSP auction is incentive compatible and 
stable, in that it has a "truthful' 5 e-Nash equilibrium where all players 
bid at prices equal to their marginal valuation of the resource. PSP 
is economically efficient in that the equilibrium allocation maximizes 
total user value. With simulations using a protype implementation 
of the auction game on the Internet, we investigate how convergence 
times scale with the number of bidders, as well as the trade-off between 
engineering and economic efficiency. We also provide a rate-distortion 

* Parts of this work were presented at the S th International Symposium on Dynamic 
Games and Applications, Maastricht, The Netherlands, July 1998, and at the DIM ACS 
Workshop on Economics. Game Theory, and the internet, Rutgers, NJ, April 1997. 

* Corresponding author. 
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theoretic basis for valuation of bandwidth, which leads naturally to the 
elastic demand model that is assumed in the analysis of the mechanism. 

Keywords: resource allocation, auctions, game theory, mechanism 
design, network pricing. 

1 Introduction 

Communication networks are characterized by what economists call exter- 
nalities. The value a user gets from the network depends on the other 
users. The positive externalities are that a communication network is more 
valuable if more people are connected. The negative externalities are that 
resources are shared by users who - because of distance, population size, or 
selfishness - cannot or will not coordinate their actions sufficiently to achieve 
the most desirable allocation of resources- The recognition of this reality in 
many aspects of networks and distributed computations has lead in recent 
years to the emergence of game theoretic approaches in their analysis and 
design [23, 9, 25, 33, 15, 16]. 

Prices, whether they relate to "real money w in a public network or "funny 
money" (based on quotas) in a private system, play a key role as allocation 
control signals. In the former case, this role is of course intimately tied to 
another, which is to allow a network provider to remain in business [7]. 

The telephone system and the current Internet represent two extremes 
of the relationship between resource allocation and pricing. The resources 
allocated to a telephone call are fixed, and usage prices are based on the 
predictability of the total demand at any given time. On the Internet, the 
current practice of pricing by the maximum capacity of the user's connection 
(flat-rate pricing) decouples the allocation (actual use) of resources from the 
prices. 

In the emerging multiservice networks (ATM, Next-Generation Inter- 
net), neither of these approaches are viable. The former because of the wide 
and rapidly evolving range of applications (including some which adapt to 
resource availability) will make demand more difficult to predict. And the 
latter because, once the flat fee is paid, there are no incentives to limit usage 
since increasing consumption benefits the user individually, whereas limiting 
it to sustainable levels brings benefits which are shared by all. This makes 
it vulnerable to the well-known "tragedy of the commons". With flat pric- 
ing alone, the tendency is toward increasing congestion which chases away 
high-value users, or increasing prices which exclude low-value users [6], in 
both cases leading to decreased network revenue. 
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Thus there is a need to develop new approaches to pricing of network 
resources* Among the requirements are: sensitivity to the range of resource 
requirements (either through a sufficiently rich range of traffic classes which 
are priced differently, or by allowing users to explicitly quantify resource 
requirements); prices must be dynamically responsive to unpredictable de- 
mand (market based system); perhaps most importantly the pricing archi- 
tecture should constrain as little as possible the efficiency trade-offs of the 
policies. 

Indeed, the fundamental issue in designing pricing policies is the trade- 
off between engineering efficiency and economic efficiency. This trade-off, 
which is more or less constrained by the underlying network technology, has 
many dimensions, including: 

• how much measurement (from usage to capacity pricing), 

• the granularity of differently priced service offerings (e.g. number of 
traffic classes), 

• the level of resource aggregation - both in time and in space - at which 
pricing is done (per packet /cell or per connection, at the edge of the 
network or at each hop), and 

• the information requirement (how much a priori knowledge of user 
behavior and preferences is required/assumed by the network in com- 
puting prices). 

An approach which achieves economic efficiency is the smart-market ap- 
proach of [19], wherein each packet contains a bid, and if it is served, pays 
a clearing price given by the highest bid among packets which are denied 
service (dropped). This approach is incentive compatible in that the optimal 
strategy for a (selfish) user is to set the bid price in each packet equal to 
the true valuation. Each node in the network becomes an efficient market, 
but the engineering cost (sorting packets by bid price, as well as per-packet 
and per-hop accounting) could be significant if line speeds are high relative 
to the processing power in the router. In [14], users are charged according 
to a combination of declared and measured characteristics of traffic. By 
taking an equivalent bandwidth model of resource utilization, and assuming 
appropriate traffic models, a menu of pricing plans indexed by the declared 
traffic can be offered which encourages users to make truthful declarations 
(e.g. of the mean rate), and also encourages the users' characterization ef- 
forts to be directed where they are most relevant to the network resource 
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allocation. As the pricing is relative, [14] does not aim to address the prob- 
lem of determining the actual monetary values of the market price (that 
users would be willing to pay). Another pricing scheme which incorpo- 
rates multiplexing gain is formulated in [12]. These and a number of other 
schemes are summarized in [11], in a comprehensive view of the connection 
establishment process, which identifies the user-network negotiation as the 
key "missing link" in network engineering/economic research. In terms of 
our taxonomy of the previous paragraph, this is part of the information re- 
quirement trade-off. Indeed, in the absence of formal mechanisms to deal 
with the information problem, complex and (at least intuitively) undesirable 
things happen- For example, some providers offer expensive 'front of the 
book" rates to uninformed customers, and lower "back of the book" rates to 
informed customers who may be about to defect to another carrier (see [7] 
and also the recent wars between AT&T and MCI in consumer long-distance 
service in the United States). In [34], it is argued that architectural consid- 
erations such as where charges are assessed should take precedence over the 
pursuit of optimal efficiency,* and edge pricing (spatial aggregation in terms 
of our taxonomy) is proposed as a useful paradigm. 

In this paper, we propose a new auction mechanism which accommodates 
various dimensions of the engineering-economics trade-off. The mechanism 
applies to a generic arbitrarily divisible and additive resource model (which 
may be equivalent bandwidth, peak rate, contract regions, etc., at any level 
of aggregation.) It does not assume any specific mapping of resource allo- 
cation to quality of service. Rather, users are defined as having an explicit 
monetary valuation of quantities of resource, which the network doesn't or 
can't know a priori. Thus, in terms of our trade-off taxonomy, this mech- 
anism aims for unlimited granularity, flexibility in the level of aggregation 
and minimal information requirement. 

In the most likely auction scenaria, users would be aggregates of many 
flows data flows for which bulk capacity is being purchased for e.g. Virtual 
Paths, Virtual Private Networks, or edge capacity [2, 31]. 

We begin in Section 2 by formally presenting the design of our Progres- 
sive Second Price auction mechanism for sharing a single arbitrarily divisible 
resource, and relating it to classical mechanism design from the economics 
litterature. In Section 3, after describing our model of user preferences 
and the elastic demand assumption, we prove that PSP has the desired 
properties of incentive compatibility, stability, and efficiency. The section 
concludes with simulation results on the convergence properties, and the 
efficiency trade-offs. Appendix A describes an information theoretic basis 
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for valuations of the type that are assumed in the analysis of Sections 3, as 
one possible justification. 

2 Design of an Auction for a Divisible Resource 
2.1 Message Process 

Following [36], it is useful to expose the design in terms of its two aspects: 
realization, where a message process that enables a certain allocation ob- 
jective is defined; and Nash implementation, where allocation rules are de- 
signed with incentives which drive the players to an equilibrium where the 
(designer's) desired allocation is achieved. 

In this section we define the message process. Here we make the fun- 
damental choice which will constrain the subsequent aspects of the design* 
Our first concern here is with engineering. For the sake of scalability in a 
network setting, we shall aim for a process where a) the exchanged messages 
are as small as possible, while still conveying enough information to allow 
resource allocation and pricing to be performed without any a-priori knowl- 
edge of demand (market research, etc.); and b) the amount of computation 
at the center is minimized. 

Given a quantity Q of a resource, and a set of players I = {1, . . .,/}, 
an auction is a mechanism consisting of: 1) players submitting bids, i.e. 
declaring their desired share of the total resource and a price they are willing 
to pay for it, and 2) the auctioneer allocating shares of the resource to the 
players based on their bids* 

Player Vs bid is Si = (qupi) € Si = [0, Q] X [0, oo), meaning he would like 
a quantity # at a unit price p t *. A bid profile is 5 = .... $/). Following 
standard game theoretic notation, let s^i s (si,...,Si-_i,«s t - + i,.. i.e. 
the bid profile of player i's opponents, obtained from s by deleting s,\ When 
we wish to emphasize a dependence on a particular player ? s bid s», we will 
write the profile 5 as (s^s^i). 

The allocation is done by an allocation rule A, 

A : S — > S 

* = A(*) = (a(a),c(s)), 

where 5= Yli&Si- 

The i-th row of A(s)> Ai(s) — (a,'(s),c t '(s)), is the allocation to player 
i: she gets a quantity a{(s) for which she is charged c t *(s). Note that p is a 
price per unit and c is a total cost. 
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An allocation rale -4 is feasible if Vs, 
and Vi € 1, 

a>i(s) < ft, 
d(s) < piqi. 

Remark a: The above formulation is a generalization of what is usually 
meant by an auction. The latter is the special case where a w (s) = Q for 
some winner w G 1 and a{(s) = 0, Vi ^ w, i.e. the sale of a single indivisible 
object to one buyer, for which the theory is well developed [22, 24]. In our 
approach, allocations are for arbitrary shares of the total available quantity 
of resource. Equivalently, one could slice the resource into many small units, 
each of which is auctioned as an indivisible object- But in a practical imple- 
mentation of auctions for sharing a resource, a process of bidding for each 
individual unit would result in a tremendous signaling overhead. More im- 
portantly, since the users would be bidding on a discrete grid of quantities, 
analytical predictions of outcomes could be misleading since they could be 
sensitive to the particular choice of grid 1 . 

Remark b: Most of the mechanism design literature in Economics 
makes use of the following "Revelation Principle": 

Given any feasible auction mechanism, there exists an equivalent 2 
feasible direct revelation mechanism which gives to the seller and 
all bidders the same expected utilities as the given mechanism. 
([24], Lemma 1) 

In this sharing context, a direct revelation mechanism would be one where 
each user message consists of the user's type, which is the valuation 3 of the 
resource over the whole range of their possible demands, i.e. a function 
9< • [OjQ] — [0r^)? a ad the budget (see Section 3.1). A consequence of 
revelation principle is that the mechanism designer can restrict her atten- 
tion to direct revelation mechanisms, find the best mechanism in terms of 

1 For a more detailed discussion of this point, see [4] p. 34, and references therein. 

2 By equivalent, in [24] it is meant that, at some equilibrium, all players get the same 
utility. There may be other, possibly ill-behaved, equilibria. 

3 The valuation of a given amount of resource is how much the user is willing to pay for 
that quantity. The inverse of the valuation is the user's demand function, giving a desired 
quantity for each price. 
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her (economic) efficiency objectives, and then - if necessary - transform it 
into an equivalent mechanism in the desired message space. This is conve- 
nient because one can exclude the infinitely many mechanisms with larger 
message spaces, without fear of missing any better designs. The design pro- 
cess is usually the solution of an optimization (mathematical programming) 
problem- For this reason, in the litterature, mechanism design problems are 
mostly solved for cases where the space of users 5 types is one dimensional, 
or at most finite dimensional [21], using message spaces that are of the same 
dimension. 

In our sharing problem, the conventional approach is unsatisfactory in 
two ways: 

♦ First, a user's type is infinite-dimensional, as we do not restrict the 
valuation functions beyond some very general assumptions (see Sec- 
tion 3.1), and so the conventional "programming* approach of de- 
riving the mechanism from the revelation principle would lead to an 
intractable problem. 

• Second, the conventional (direct revelation) approach, even if it was 
tractable, implies that a single message (bid) can theoretically be in- 
finitely long, because it has to contain a description of the function 0,-. 
Clearly, this is not desirable in a communication network, where sig- 
naling load is a key consideration. For engineering reasons, we choose 
a message space that is 2-dimensional. Therefore, a given message can 
come from many possible types, so there is no single way to do the 
transformation from the direct revelation mechanism to the desired 
one. 

Thus, unlike most of the mechanism design literature, we will take a di- 
rect approach, where we posit an allocation rule for our desired message 
space, and then show that it has an equilibrium, and that the design objec- 
tive is met at equilibrium 4 . This is equivalent to guessing the right direct- 
revelation-to-desired-mechanism transformation and building it into the al- 
location rule from the start. 

4 Our aim is to to show that we can use this smaller message space and still achieve our 
objective. 



A-007 



ai=Q-q5-q4 
ci — flfl 



Figure 1: Exclusion-compensation prindple: the intuition behind the PSP 
rule 



2.2 Allocation Rule 

Define, for y > 0 



(i) 



and 



Gi(lM-i) = lim QXij,s-i) = 



Q- E 

The "progressive second price" (PSP) allocation rule is defined as follows: 



m(s) = ?iAg.(R,5-t)» 



(2) 
(3) 



where A means taking the minimum. 

Remark a; For a fixed opponent profile ,s_ t . Qi(pi,S-i) represents the 
maximum available quantity at a bid price of /?,. The intuition behind 
PSP is an exclusion-compensation principle: player % pays for his allocation 
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so as to exactly cover the "social opportunity cost" which is given by the de- 
clared willingness to pay (bids) of the users who are excluded by i's presence 
(see Figure 1), and thus also compensates the seller for the maximum lost 
potential revenue. Note that this amounts to implicitly assuming that the 
bid price accurately reflects the marginal valuation 0(* on the range [0,-,$]. 
In other words, by this rule the auctioneer is saying to the player: "if you 
bid (qi,Pi)<, I take it to mean that in the vicinity of can be approxi- 
mated by a line of slope This is the (built-in) transformation from the 
direct-revelation mechanism to the desired message process discussed in the 
second remark at the end of Section 2.1. 

The charge c,- increases with a t * in a manner similar to the income tax in 
a progressive tax system. For a fixed opponent profile imagine player i 
is increasing starting from 0. The first few units that player i gets will be 
taken away from the lowest clearing opponent (i.e. m = argminj{pj : aj > 
0}), and player i will pay a price (marginal cost) p m per unit. When a m 
reaches 0, the subsequent units that player i gets will cost him p^ > p m , 
where m' is the new lowest clearing player, the one just above m. The 
PSP rule is the natural generalization of second-price auctions (or Vickrey 
auctions). In a Vickrey auction of a single non-divisible object, each player 
submits a sealed bid. and the object is sold to the highest bidder at the bid 
price of the second highest bidder, which is what happens here if g t - = Q, Vi. 
This is widely known to have many desirable properties [35, 24, 4], the most 
important of which is that it has an equilibrium profile where all players bid 
their true valuation. As we will presently show, this property is preserved 
by the PSP rule in the more general case of sharing an arbitrarily divisible 
resource, and this leads to stability (Nash equilibrium). The PSP rule is 
analogous to Clarke-Groves mechanisms [3, 8, 20] in the direct-revelation 
case. 

Remark b: When two players bid at exactly the same price, and they 
are asking for more than is available at that price, (2) punishes both of them. 
For example, if Q =r 100 and si = (4,60) and s 2 = (4,70), the allocations 
would be m = 60 A ( 100 - 70) = 30, and a 2 = 70 A (100 - 60) = 40. Since 
the bid prices are equal, there is no "right" way to decide who to give the 
remaining capacity to. One could divide it equally, or proportionally to their 
requests, etc. For the subsequent analysis, it turns out it is simpler to not 
give it to either one [of course, it will be allocated to the lower bidders if 
there are any). This is just a technicality since by deciding this, we ensure 
that it will never happen (at equilibrium), since the users will always prefer 
to change their prices and/or reduce their quantity. 
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Considering the computational complexity of PSP> a straightforward im- 
plementation would at worst, sort the bids in time /log/, perform (2) in 
linear time, and (3) can be done in time P. Thus, the complexity of com- 
puting the allocations is O(P). 

3 Analysis of the Progressive Second Price Auc- 
tion 

3.1 User Preferences 

Since the allocation rule A is given by design, the only analytical assump- 
tions we make is on the form of the players* preferences. 
Player i's preferences are given by his utility function 

m : S — ► (—00,00) 
s 1 — > 

Player i has a valuation of the resource &i(ai(s)) > 0, which is the total 
value to her of her allocation. Thus, for a bid profile of 3, under allocation 
rule A 7 player i getting an allocation A{(s) has the quasi-linear utility 

Ui(s) = 8i(ai(s))-Ci(s) (4) 

which is simply the value of what she gets minus the cost. 

In addition, the player can be constrained by a budget 6,- € [0, 00], so 
the bid $i must lie in the set 

Si(s-i) = {si € Si : < 6,}. (5) 

In the proofs of the following section, we will assume that users have 
elastic demand, that is: 

Assumption 1 For any 

• 0i(O) = 0. 

• #i is differ* ntiabk, 

• ^ > 0, nan* increasing and continuous 

• 3 7 ; > 0. Vr > 0, B\(z) > 0 => Vj? < z, z) < - 7i (z - 7?). 
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The last item says that as long as the valuation is strictly increasing, it must 
also be strictly concave (with minimum curvature 7,-). However, it is allowed 
to "flatten" beyond a certain amount of resource. 

Functions of this (concave) form have wide applicability as models of 
resource valuation, and can be justified from the economic standpoint (di- 
minishing returns) as well as from information theoretic standpoint - see 
Appendix A. For examples of valuations satisfying Assumption 1, see Sec- 
tion 3.4 and Appendix A. 

3.2 Equilibrium of PSP 

The auction game is given by (<J, uu - ■ that is, by specifying the 

resource, the players, and a feasible allocation rule. We analyze it as a 
strategic game of complete information [4]. 

Define the set of best replies to a profile s* t - of opponents bids: S^(s^i) = 
{si € Si(s-i) : ui(s i; s_i) > ui(s<; Vs{ 6 let = lit S7(s-i) 

A Nash equilibrium is a fixed point of the point-to-set mapping 5*, i.e. a 
profile 5 € S*(s). In other words, it is a point from which no player will 
want to unilaterally deviate. Such a point is what is most accepted as a 
consistent prediction of the actual outcome of a game, and has been re- 
peatedly confirmed by experiments, as well as a wide range of theoretical 
approaches. Indeed, in a dynamic game, where players recompute the best 
response to the current strategy profile of their opponents, this iteration 
can only converge to a Nash equilibrium (if it converges at all). In addition, 
an important trend in modern game theory is the development of learning 
models, and there too, it has been shown that Nash equilibria result also 
from rational learning through repeated play among the same players [13]. 

A more general (and hence weaker) notion of stability is the existence 
of an e-Nash equilibrium. Let the €-best replies be Sf(s-i) = {$i € 5i(s_ t ) : 
Ui($i; > Ui($' { : - €, Vs- € 5i(<s»;)}. An e-Nash equilibrium is a fixed 
point of 5 C . 

In a dynamic auction game, e > 0 can be interpreted as a bid fee paid 
by a bidder each time they submit a bid. Thus, the user will send a best 
reply bid as long as it improves her current utility by and the game can 
only end at an €-Xash equilibrium. 

Define 

Pi(z 9 s_i) = inf {y > 0 : Qi{y, > z) . (6) 
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Thus, for fixed Vjr, z > 0, 



z<Qi{y,s-i)=>y>Pi(z,s-i) 



(7) 



and 5 



y > Pi(z,s-i) =>z< 



(8) 



The graph of i^(.,$~ t -) is the "staircase" shown in Figure 1, and that of 
Q t -(.,5-t) is obtained by flipping it 90 degrees. 
It is readily apparent that 



The key property of PSP is that, for a given opponent profile, a player 
cannot do much better than simply tell the truth, which in this setting means 
bidding at a price equal to the marginal valuation, i.e. set pi = By 
doing so, she can always get within e > 0 of the best utility. 

Let Ti =s {si € Si : pi = the (unconstrained) set of player i's 

truthful bids, and T - fL* ^ • 

Proposition 1 (Incentive compatibility ) Under Assumption 1, Vi € X, 
Vs_» € S-i, such that Qi(Q,$-i) = 0, for any € > Q, there exists a truthful 
€-best reply U($-i) € % 0 S\{s^ E ). 
In particular, let 



Then with v { - [sup Gi(s-i) - €/^(0)] + and w { = B&Vi), U = (v iy Wi) 6 



The truthful best reply can be found in a straightforward manner, as 
illustrated in Figure 2. 

Proof: Fix € «SL». Let Zi = supG,(j?_ t ) and & = 

By definition of 2,-, 3{z(n)} C G,(s~») such that linin^n) = 2,-. Hence 6,- > 
hm„/^ n ^Pt(^- s~i)drj = f^* Pi{rj, s^i) drj > Ci(< t ;5-,), where the equality comes 
from the boundedness of P x - and the Lebesgue dominated convergence theorem; and 
the second inequality from (9) and (2). Thus U 6 TflS,^). 

5 Actually, since Q;(. 7 s-.,) is upper-semi-continuous (jumps up), we have z < 
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Figure 2: Truthful e-best reply 



Next we show that f» 6 5/ (s-,). First, z,- = limn < hnin Qi(6<(z(n)) t s_$) < 
Qi (Iim n 0i(z(n)), where the inequalities follow respectively from z(n) € G»(s_i) , 
and the upper semi-continuity of Now by the continuity of Q,-(lim n 0j(z(n)), s 

Qi(0i( 2 i)t s -i) = QiiViy^i), hence 

*<9<(W, *-0- (10) 

Now, we claim that = v,-. Indeed, if z,- = 0 then v,* = 0 and a{(U; = 0. 

If z{ > 0, then by (10), Qi(yi,s-.i) > 0 and since by hypothesis Q«(0 f £_,-) = 0, we 
have Oi{zi) = > 0. Also, z,* > 0 implies t/ a - < z,*. Therefore, by Assumption 1, 
we have w% = #,'( v *) > flJCzi) = Hence, since £?.(.,$-,*) is non-decreasing, 
Q.(wi 9 ?-f) > lim^xy ^.(77, J?-,) = Qi(yi,s„i) > z { > i^Thus, by (2), 

ai(ti;s-i) = v.- (11) 

Now Vs f - G 

= *i(ai{fc;*-i)) " *«(«*(*)) - s-») + Ci(*) 

= ^ a,t ^ tfl(*. *-0 - «{«] <fc + £' ir,i=. *-<) - «{(--)] & 
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(12) 



> / (?i(z t a.i)-«IW]-fe-f 



where the inequality follows from (z» — vi) < f/^(0) and the fact that tfj is non- 
increasing. Thus, it suffices to show that the integral is > 0. 

If Zi < a,*(s), take any z € (z,' f a>i{s)}> By the definition of z,-, z £ G»(s-i). Now 
Si G 5i( 5 -0 implies 6, > «(*) = fi M Pi(v, dv > Jo Therefore, 
we must have z > <3i(0;( z ))> Wflicn by (8), implies 0J(z) < #(z) and the integrand 
in (12) is > 0 as desired. 

Suppose zi > ai(si). Since B[ is non-increasing, Qi(.,s-t) is non-decreasing and 
> 0, any point to the left of z,- is in the set Gi(s-t), Vz < z t -, z G G,(s_i), 
hence z < Qj(0}(z),s.,-) which by (7), implies 0<(z) > i^(z, so the integrand 
in (12) is < 0 as desired. O 

Figure 3 shows the utility function of player 4, ^4(^4), in a PSP auction 
with 7 = 5 players, with $-4 fixed, and a valuation 9 4 (q) = 10$. The plateaus 

correspond to the points where q 4 > f?4(p4,s) = \o - 2{j V y>p 4 }<y(«)] ? 



and 04(5) can no longer be increased at that bid price - see (2). At bid prices 
P4 > Ps? the utility decreases when 04 > Q — 95, because after that point, 
each additional unit of resource is taken away from player 5, and thus costs 
p 5 , which is more than ff A its value to player L Thus, each additional unit 
starts bringing negative utility. This is what discourages users from bidding 
above their valuation. Proposition 1 is illustrated by the fact that for any 
given quantity #4. the utility 114 is maximized on the plane p 4 = 0* 4 s 10. 

Remark: When the players have linear valuations and no budget con- 
straint (b{ = 00), PSP becomes identical to a second-price auction for a 
non-divisible object. Then the existence of a Nash equilibrium follows di- 
rectly from incentive compatibility. 

Note that in PSP, the incentive compatibility (optimality of truth-telling) 
is in the price dimension, for a given quantity. With the message space we 
Have designed, there is no single "true" quantity to declare, the optimal 
quantity depends on opponent bid prices. Were the message process such 
that players declared a price and a budget (rather than desired quantity), 
it may have been possible to design an allocation rule A such that they are 
inclined to reveal their true budget, thus obtaining incentive compatibility in 
both dimensions, and hence equilibrium. But such a rule A would likely not 
have a simple closed form like (2) and (3). In essence, the computational load 
of translating budgets into shares would be centralized at the auctioneer, 
thus making the system less scalable to large numbers of users. On the 
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price — p4 0 0 quantity — q4 

Figure 3: Utility u 4 (s 4 ) for s x = (100,1), s 2 = (10,2), s 3 = (20,4), s 5 = 
(20.7), 5 6 = (30, 12) 

other hand, decentralization has a cost too, which is the signaling overhead 
resulting from players possibly adjusting bids based on opponent bids in the 
iterated game. Our design is based on the premise that the latter approach 
is the more scalable of the two (indeed that was the reason for choosing a 
small message space). 

The next property is that the truthful best reply is continuous in op- 
ponent profiles (this can be seen in Figure 2: as the "staircase" is varied 
smoothly, the point of intersection with 6\ moves smoothly, provided 8\ is 
not flat - which is given by the last time item in Assumption 1)* To prove 
that, we will need the following: 

Lemma 1 Vs,s' € S,Vy,z > 0, W > 0, «/I|s_ t - -sl t -|| < S then 

Qi(y + 8. 5., ) + SVI > Qi(y, O > Qi(y - S, - iVl, (13) 

and 

Pi(z + 6>/I. s-i) + S > Pi&slj) > Pi(z - 6VJ,s-i) - <5. (14) 

Proof: First, ||s_, -.v'_ -j| < S implies^ \q' k -qk\ < 6\/7, and p k +6 > pj. > pk-6. 

Thus, E*ff*l{p*+*>») +^ > Hk<lkhv' k >y} > Hk<lkl{p k -6>y) -«V7. Then, 
using (1) and the identity (a + b)+ < (a) + -f (£) + t the first result follows. 
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For any y < P,(z, $L,.), by (7), we have z > Qi(y, s 1 ^) > Qi(y - - *vT, 
which by (8), implies y — 6 < Pi(z + 6y/I,s-i)- Letting y / P^z,^,), we get 

For any y > Pi(z, s'.J, by (8), we have z < t?i(y, < Qi{y + <5,*_,) + *V7, 
which by (7) implies y + 6 > Pi(z - <$V7). Letting y \ .ft we get 

Piiz^^yPiiz-S^-S. a 

Lemma 2 (Continuity of best reply) Under Assumption 1, Vi € I, ifte €-6es* 
rep/y t» tfiven in Proposition J is continuous in s_ t * on any subset V{(P,P) = 
{s^i € : V* > 0,P > Pi(jB,3-i) > £}, with 00 > 7 > £> 0. 

Proof: Let z; = sup Gi(s~i). We will show z* is continuous, and the continuity of 
V{ = ^ = foll° w immediately (recall that by Assumption 1, 

6\ is continuous). 

Suppose there is a discontinuity at some 5_*. Then, 3t\ > 0, such that V£ > 0, 
€ K(P» P) with - < 6 and ||zi - z<\\ > e lt where zj = supG,-^). 

Suppose Z{ j -f- ci < z\ = sup{? t -(sl t ) (the case z t ' -hci < z,* is handled identically, 
with 5_i and sl € interchanged. ), Consider the definition of G t *; since B\ is decreas- 
ing and <2i(.,s-i) is non-decreasing and Pj(.,s_i) > 0, any point to the left of zj is 
in the set Gi(s'_ i ) i therefore 



using Lemma 1. 

Also, by (7) + ti < Qiityzi + ei) t *l<) => 0<(z,- + *i) > ^ + d, *!-*). Now 
since sl t - € K(P, P), this last expression is > P > 0, hence tf<(z» + e x ) > 0. Then 
using Assumption 1, 0<(z t - + c 2 ) > 0*(z,- + ei) -f 7.(^1 - €2) > 0*(*t + *i) + for 
£ < ^i = ^ A ci?,-, and 0 < e 2 < (*i - $iV7) A {t\ - £1/7*). Therefore, since 
Qi(.,s-i) is non-decreasing, 



Thus, zi + ei < Q,(^(^ + *i)>0 = [O - £* «il{pi>^(*.+ci))] + - Therefore, 
* + «i < + f 1} + 6, s^i) + W7, 



(15) 



zi + e 2 < QiW& + ^2), + *V7 - €! + € 2 



(16) 



Now (15) also implies that 
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and this holds V<r3 < ei- Now, using Lemma 1, 



/ [Pi(TI,s'_ i )-P i (r,,s. i )]dr, 
Jo 

> / P^-WI,^;)^- / ft(lj,«-i)<f? 

Jo Jo 

> / *f 

> -(q + p)$VI 

Let 6 2 = (g ^v7 > a & d €3 such that 0 < e 3 < e 2 - (Q + 7 5 )«aV?/£- Then 

6 f > r ,+ * *|, (17) 

Jo 

for £ < 62. 

Now choosing S < 6 X Atf 2 , (16) and (17) imply that Gi(s~0 9 fe-KsjAfc+ea) > 
z f * = sup Gi(s~i) 9 a contradiction. D 

We introduce one additional player, player 0, whose valuation is 0q(z) = 
poZj and whose bid can therefore be fixed at so = (<Zo?Po) = (Q?Po)* Player 
0 can be viewed as the auctioneer, and po > 0 as a "reserve price" at which 
the seller is willing to u buy* all of the resource from himself. From (1), the 
presence of the bid so = (Q,po) implies Vi 6 XiQi(y,s-i) = 0,Vy < po- In 
particular, setting y = 0, the condition of Proposition 1 holds. Thus, we can 
restrict our attention to truthful strategies only, and still have feasible best 
replies. This forms a -truthful" game embedded within the larger auction 
game, where the strategy space is T C «S, the feasible sets are % fl Si(s^i)^ 
and the best replies are Rj{s) = 7^0 S- (s). A fixed point of R c in T is a 
fixed point of S t in 5. Thus an equilibrium of the embedded game is an 
equilibrium of the whole game. 

Proposition 2 (Nash equilibrium) In the auction game mth the PSP rule 
given by (2) and (3) and a reserve price po > 0, and players described by (4) 
and (5), if Assumption 1 holds, then for any e > 0, there exists a truthful 
e-Nash equilibrium s m € T. 

Proof; Vs € T, Vi € I. Vz > 0, we have z > 0 = Q»W2, «-,■), which by (8) 
implies Pi(z,s-i) > po/2 = P. Let P = maxjfc €: ru{o} ^i(^)* Then, the conditions 
of Lemma 2 are satisfied and t = (v, u;) is continuous in s on T. By Assump- 
tion 1. B\ is continuous therefore v(q,p) = v($ 7 0'(g)) (as defined in Proposition 1), 
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can be viewed as a continuous mapping of [0, Q) J onto itself. By Brouwer's fixed- 
point theorem (see for example [10]), any continuous mapping of a convex compact 
set into itself has at least one fixed point, i.e. 3g* = v(q*) € [G,QY- Now with 
s* = (g*, we tave s * = € T. a 



3.3 Efficiency 

The objective in designing the auction is that, at equilbrium, resources al- 
ways go to those who value them most. Indeed, the PSP mechanism does 
have that property. This can be loosely argued as follows: for each player, 
the marginal valuation is never greater than the bid price of any opponent 
who is getting a non-zero allocation. Thus, whenever there is a player j 
whose marginal valuation is less than player Vs and j is getting a non- 
zero allocation, i can take some away from j 9 paying a price less than Vs 
marginal valuation, i.e. increasing but also increasing the total value, 
since i*s marginal value is greater. Thus at equilibrium, i.e. when no one 
caji unilaterally increase their utility, the total value is maximized. Formally, 
consider a € argmax^ £i rdt(ot)- The Karush-Kuhn-Tucker [17] optimality 
conditions are that there exists a Lagrange multiplier A such that fi;(a f -) — ^» 
if at > 0, and «S(0) < A, if a,- = 0. 

Assumption 2 For any i € X, b{ = oo, and 9$ satisfied 

«S(Z) -Oft/) >-K(z-Z% 

whenever z > z 1 > 0. 

Given any e > 0, for any €-Nash-equilbrium s* € T, let a* = a(s*), and 
let a* = nun f€ j U {o} t o t >o a i ? the smallest non-zero allocation. The following 
is the "e version" of the Karush-Kuhn-Tucker conditions. 

Lemma 3 Suppose Assumptions 1 and 2 hold. If for some j, a* > ^/c/k, 
thenVi €lU{0}, 

«?(aJ)<fl}(aJ) + 2V^. 

An immediate corollary is that if a* > \JTfk then 

A* - 2yf€K < flfta?) < A* + 2y^ 

6 If $i is differenttabk, the condition is 0 > 0" > -k . 
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if a* > y/e/K and 

A* + 2^, 

ifdi = Of for some A* > 0, 

Proof: Suppose tfj(a?) > ^(ap + 2y/tK. Since 0j(aJ) > > pj, we have 

Since a!- > 0, if player f bids at a price above pj, he can take all of player j's 
allocation, without losing anything of his own, i.e. a? < Qi(pJ» 5 -i)- % (7), 
this implies 

>w +«;,*!*).. 

Let ? t - = (a? + V*/*) 311(1 *» = (?••> ^ (««))• Then 



> [flKa? + v^)-?jJV^ 



> 



which contradicts the fact that s* is an e-Nash equilibrium. O 



Proposition 3 (Efficiency) Suppose Assumptions 1 and 2 hold. If a* > 
yfefn, then 

max £ ft(o,0 - £ ft(a?) = 0( y/St), 

i" i 

where A = {a G [0 ? Q] /+1 : £,a,- < Q}. 

Proof: (of Proposition 3) Let I + = {A : a* > aj} and Z~ - {k : a k < a%}. For 
i € I + , we have 0-( a i ) < A" + 2v/eK. For i € T", we have a- > a t * > 0, therefore 
by the lemma, 0f( fl ?) > A* - 2>/e/c. Therefore, 

< (A* + 2Ve^)A - (A* - 2V^)A T 

where A = — a?) = Hj-( a i *" a *)' Since A < Q the result follows, with 

the bound AQy/ai. O 



A-019 



iniMni iiriin Pipir 



Remark a: The condition 6, = oo is sufficient, but not necessary to 
achieve efficient outcomes. In fact with any budget profile, efficiency can 
be achieved if the users cooperate. For example, if they all choose a bid 
quantity close to what they can actually obtain (which they do if they use 
the strategy given by Proposition 1), then the price paid would be po per 
unit for all the allocations, and if po or the shares a* are not too large, 
then budget constraints are irrelevant and a* is efficient. More generally, 
efficiency is attained if the budgets are not too far out of line with the 
valuations, i.e. there are no players with very high demand and very low 
budget. 

Remark b: ( Welfare and Efficiency) A more common measure of effi- 
ciency is the social welfare, which is the sum of all the players' utility u,-, 
including the seller i = 0. The natural definition of the seller's utility is the 
value of the leftover capacity a 0 = Q — Yli^o °* pl us *he revenue, i.e. 

no = 0o(a Q ) + ^2 Ci. 

Then, u,- = E^o(^i - + «o = E*o «• + *o(«o) = Ei ft- Thus, ti; 
is equivalent to the efficiency measure used above, which is Jwft. Another 
measure is the seller's revenue. Even though PSP is not, in general, revenue- 
maximizing, it tends to the revenue maximizing allocations and prices as 
demand increases [18]. 

Remark c: Proposition 3 provides a key to understanding the basic 
trade-off between engineering , and economic efficiency. The smaller e, the 
closer we get to the value-optimal allocations. But in a dynamic game, 
where players iteratively adjust their bids to the opponent profile, a player 
will bid as long as he can gain at least e utility (since that is the cost of 
the bid), thus a smaller e makes the iteration take longer to converge, i.e. 
entails more signaling. 

3.4 Convergence 

An issue of obvious concern is whether the game converges under dynamic 
play: it turns out that it does, when users behave rationally (see Proposition 
4 in Chapter 2 of [29]). Moreover, irrational or malicious behavior - like 
intentionally trying to prevent convergence by making unnecessary bids - 
can always be controlled by setting the bid fee e high enough to make such 
behavior prohibitively costly for the culprit. 
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Figure 4: Parabolic valuation with k = 0*5 and q = 70 

Another issue is how the convergence time scales with the number of 
bidders. We now consider this experimentally using the software described 
in Appendix B. 

In all our simulations we let Q = 100. For each user, the valuation is 
strictly increasing and concave up to a maximum corresponding to a physical 
line capacity, and flat beyond that. Since, as shown by Proposition 3, only 
the second derivative of the valuation is needed to measure the efficiency 
of the PSP auction, a second order (parabolic) model is deemed sufficient. 
Thus we use valuations of the form: 

9i(z) = -Ki(z A q { ) 2 /2 + Kiqi{z A gj, 

where q { is the line rate, and Ki > 0 (see Figure 4). 

We generate our user population with independent random variables 
{0*(O)}j (corresponding to the maximum unit price the user would pay) 
uniformly distributed on [10,20], and K{ = 0f(O)/g 1? and q { uniformly dis- 
tributed on [50, 100]. All players have a budget = 100. The bid fee is 
fixed at e = 5. Each user has a bidding agent which can submit at most one 
bid per second (see Algorithm 1 in Appendix B). 

With this set-up, the results are shown in Figures 5-6. Simulations were 
run for 11 population sizes ranging from 2 to 96 players. Each point is 
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Figure 5: Mean (+/- std. dev.) number of bids - solid line. The dashed 
line is I + I 2 /1Q. 

simulated 10 times with new random valuations for all players. The overall 
mean is 11.9 bids per player. From Figure 5, the number of bids seems to 
grow as the square of the number of players. 

The actual time to converge, shown in Figure 6, grows more slowly, since 
the computation of bids is done in parallel by all the players. In fact, for 
small numbers of players, the time decreases. This can best be explained as 
follows. Suppose there are only two players, with similar valuations. They 
will both start by asking for their maximum quantity, at their marginal 
valuation (which at their maximum quantity is near zero). Then as each 
sees the other's bid, each will reduce the quantity and increase the price a 
little bit. And they go on taking turns, gradually raising the market price 
until they reach an equilibrium. However if there are 10 players, in between 
two bids by the same player, the 9 others will already have bid up the 
price, so he will jump to higher price than if there was only one opponent. 
Thus the equilibrium market price will be reached more quickly. For large 
populations, this effect becomes small compared to the sheer volume of bids, 
and the convergence time starts to grow. 

The trade-off between signaling and economic efficiency discussed in light 
of Proposition 3 is illustrated by Figures 7-8. Increasing the bid fee speeds 
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up convergence, at a cost of lost efficiency. A resource manager should select 
a bid fee which optimally balances the two for the particular context. 

Figure 8 also illustrates the validity of the lower bound given by Propo- 
sition 3. 

4 Conclusion 

Auctions are one of oldest surviving classes of economic insti- 
tutions [,..] As impressive as the historical longevity is the re- 
markable range of situations in which they are currently used. 
[22] 

We proposed the progressive second price auction, a new auction which 
generalizes key properties of traditional single non-divisible object auctions 
to the case where an arbitrarily divisible resource is to be shared- We have 
shown that our auction rule, assuming an elastic-demand model of user pref- 
erences, constitutes a stable and efficient allocation and pricing mechanism 
in a network context. Even though we are motivated by problems of band- 
width and buffer space reservation in a communication network, the auction 
*,vas formulated in a manner which is generic enough for use in a wide range 
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of situations. In the sequel to this work, we show that the key results - 
namely incentive compatibility, equilibrium, and efficiency * generalize to a 
setting where multiple networked resources are auctioned, with users bid- 
ding on arbitrary but fixed routes and topologies [29, 31]. In related work, 
we consider the case of stochastically arriving players bidding for advance 
reservations (i.e. resources for a given period of time) [29, 30]. 

An interesting direction of future work is learning strategies, and evolu- 
tionary behavior which can emerge from repeated inter-action between the 
same players. 
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A Information-theoretic basis for the valuation 



In general, valuations are simply assumed to be given as external factors. 
Indeed, the fundamental assumption in any market theory is that buyers 
know what the goods are worth to them. The "elastic demand 77 or "di- 
minishing returns" nature of Assumption 1 is fully justified from a purely 
economic standpoint for virtually all resources in everyday life. 

In the case of variable bandwidth, we can go even further by better quan- 
tifying what the goods are. For a user sending video, say, how much value is 
lost when the channel capacity goes from L5 to 1.2 Mbps? Ultimately, the 
value lies not in the amount of raw bandwidth but in the information that is 
successfully sent. Our goal in this section is to give a brief description of how 
Information Theory can be used for a bottom-up construction of bandwidth 
valuations - based on the fundamental thing the user cares about which is 
communication of information - and that such valuations will generally be 
of the type in Assumption 1. 

Any information source has a function !?(.), such that when compressed 
to a rate ii, the signal has a distortion of at least D{R) [1]. The distor- 
tion is the least possible expected "distance" between the original and com- 
pressed signals, where the minimization is over all possible coding/decoding 
schemes. In this context, we make the distance measure the monetary cost 
of the error. This cost can be chosen, for example, to be proportional to 
some common measures like the mean squared error, the Hamming distance 
(probability of error), the maximum error, etc., or heuristic measures based 
on experiments with human perception. Given that modern source-coding 
techniques can, given a distortion measure, achieve distortions close to the 
theoretical lower bound[5], it is not unreasonable to use the rate-distortion 
curve as an indication of the value of the bandwidth share. 

Let Dj(») be the distortion-rate function of a stochastic process 

modeling the source of information associated with user L Xi is encoded as 
Yi which has a rate of R bits per second. 

Shannon's channel coding theorem[32] states that Y{ can be received 
without errors if and only if the channel has a capacity C > R- In our 
auction context, user i has capacity (bandwidth allocation) C = a*, and 
thus has to suifer a distortion of at least A'(oi)- The value of the bandwidth 
is then 

OiiaO^Wi-Diiai), (18) 
where 9\ is the value of the full information. 
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The relevant properties of the distortion-rate functions are: 



• when the rate is greater than the entropy of the source, the distortion 
is zero, and 

• for many common source models and cost functions, the distortion- 
rate function is convex, and has a continuous derivative. 

It is easy to see that, with these properties, (18) satisfies Assumption 1. 

Example 1: Let {X} be a Bernoulli source, taking two values with 
probabilities p and 1 - p. Without loss of generality, let p € [0, 1/2]. In this 
case, since the source is ii.d, one can define the distortion on a per symbol 
basis. Using a Hamming cost function 

d(X,Y) - l{x^Y}t 

i.e. assuming it costs one unit of money every time one bit is wrong, we have 
the distortion D = Ed(X,Y) = P(X ^ }*). the rate-distortion function is 

where H (x) = -x log(x) - f 1 - x) log(l - x ). and the distortion-rate function 
is the inverse function. It can be easily seen that D(R) is strictly convex and 
decreasing for 0 < R < H{p), and D(R) = 0 for R > H(p). The continuity 
of D f on 0 < R < H(p) and R > S(p) is obvious- At the critical point 
(R = Hip), D = 0), we have 

lim D\R) = lim l/R f (D) 

= Kml/log(2>/l^D) 
= 0 

= Urn D'(R). 

Thus continuity of D f holds throughout, and Assumption 1 is valid for the 
valuation of the form (18) for this source - see Figure 9. 

Example 2: Let {X} be a Gaussian source with Markovian time- 
dependency, i.e a covariance matrix $ = (a 2 ^"^ . , , r € [0. 1 ). Suppose we 
use the squared error cost. i.e. it costs one unit of money for one unit of en- 
ergy in the error signal. Then, we have for low distortions D < ( 1 -r)/(l+r), 
tf(Z?)=|log^ ? or 



D(R) = (l-r 2 .<7 2 2" 2 * 
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Figure 9: Distortion-rate based valuation for a Bernoulli p = 1/2 source 

and Assumption 1 clearly holds for (18). In the i.Ld. case (r = 0), the 
formula holds for all R. 

As the source models get more complex, it rapidly becomes impossible 
to give closed-form expressions for either R(D) or D(R)* Often parametric 
forms are available, and the functions can be evaluated numerically. Fortu- 
nately, rhe convexity property extends to a wide class of models, including 
for example auto-regressive sources, even when the generating sequence is 
non-Gaussian (see [1] for a full treatment of ii(I?), including the above 
cases). 

It can happen, e.g. for some video source models, that the R-D curve, 
which gives the best (R,D) pairs achievable by any coder/decoder, is not 
convex. But. for tractability, practical codecs are usually optimized on a 
convex hull of the space of possible (R,D) pairs[26, 27]. Thus, even when 
the theoretical D(R) curve is not convex, the actual distortion achieved in 
real-life systems almost always varies in a convex manner with the available 
bandwidih* 
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B Simulation software and bidding algorithm 

A prototype software agent based implementation of the auction game, 
called TREX, has been developed and extensively used since December 1995. 
Much of the intuition behind the mechanism design and the analysis in this 
work came from experiments done on this inter-active distributed auction 
game on the World Wide Web, using the Java programming language. The 
game can be played in real-time by any number of players from anywhere 
on the Internet [28]. 

Each user plays in the dynamic auction game using the following: 

Algorithm 1 1 Let s< = 0, and i.,* = 0. Start an independent 
thread which receives updates of ,§_,*. 

2 Compute the truthful t-best-reply of Proposition i, t t - 6 % O 

3 If Ui(ti; s~i) > U{(si; 5_ t ) + e, then send the bid $i = M* 

4 Sleep for 1 second, 

5 Go to 2. 

No assumption is made on the order of the turns. Players join the game 
at different times, and depending on the execution context of the client 
program, the sleep time of 1 second is more or less approximate. This, along 
with communication delays which make the times at which bids arrive at 
the server and updates at the clients essentially random times, make the 
distributed game completely asynchronous. 

Algorithm 1 can be described as selfish and short-sighted. Selfish because 
it will submit a new bid if and only if it can improve it's own utility (by more 
than the fee for the bid). Thus, the game can only converge 7 to an €-Nash 
equilibrium. And short-sighted because it does not take the extensive form 
of the game into account, i.e. does not use strategies which may result in a 
temporary loss but a better utility in the long run. 
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Chapter 4 

Spot and Derivative Markets in Admission 
Control 

Xet game now risk, G. [87] 

In Chapters 2 and 3, we proposed a pure market approach for bandwidth pricing, 
where allocated capacities may vary during the lifetime of a flow, as players compete 
for resources through an auction game. 

In this chapter, we propose a mechanism for circuit switched calls, wherein calls 
are admitted or rejected at (or soon after) their arrival time, and if admitted, get 
a fixed allocation of capacity, and have the option of securing the resource at a 
guaranteed maximum price for a guaranteed minimum duration. Thus the charge 
has two components: 

• a market-based usage charge, where the user continuously pays the "instan- 
taneous" market price (determined by second price auctions among recent 
arrivals); if the market price exceeds a user*5 bid price, that user is dropped 
unless 

• upon arrival, the user pays a reservation fee i buys an option contract), which 
gives him t he* right to buy the capacity ai any time in the future up to a 

if i 
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specified duration at his bid price. The user pays the market price as long as 
it is below his bic price. If at some point during the call, the market price 
exceeds the bid price, then the user automatically exercises the option, i.e., 
remains connected while paying no more than the bid price. 

In other words, we introduce a "derivative" instrument to reduce the uncertainty 
inherent in the "spot" market mechanism. Naturally, this contract (or reservation) 
must itself be sold for a "fair" price. In the context of financial markets, the fair price 
is calculated with the Black-Scholes approach [8, 65], which is based on the idea that 
the option must be priced such that a perfectly hedged (i.e., riskless) combination 
of the derivative and the underlying equity will provide (locally in time) the same 
expected return as a risk free security [36] (otherwise the derivative would present 
an arbitrage opportunky. i.e., unfair advantage, which would be exploited until its 
price rises). 

In our context, the contract differs in that, rather than the right to buy once at 
a given strike price, it gives the right to buy repeatedly over a given duration, i.e., 
it is a series of "call op:ions". Here the "fairness" we seek is that the reservation 
be priced in a way tha: reflects the probability that the system will become busier 
during the lifetime of the reservation, in order to avoid individually rational but 
socially sub-optimal behaviour. Specifically, we seek to avoid customers connecting 
at low periods and making a reservation limiting their usage price to an excessively 
low price for a very long time, to avoid rejoining when the prices are higher 1 . 

A further difference from usual options is that, rather than the standard geo- 
metric Brownian motion model of e.g., a stock price, we have an underlying spot 
market process that is derived directly from our queueing system, via a heavy-traffic 

1 A simplified form of this arbitrage is commonly observed with flat-rate priced dial-up Internet 
access. wUtr*- some users re~ain logged on for very long periods of time even iF l.hey are not using 
the network, in order to av:;d the chance of getting busy signals when they mvd to be on. 
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Figure 4-1: Queueing Model 
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approximation which leads to a diffusion process. 

From the point of view of the user, this pricing mechanism is simply an initial 
reservation fee plus a per-minute (for example) usage charge. The novelty in the 
networking context is that both axe market prices. 

In Section 4.1., we derive a heavy-traffic diffusion approximation model for the 
queueing system. Using that, in Section 4.2., we derive the corresponding model for 
the spot market price. Then in Section 4.3., we introduce our formulation of the 
reservation as a derivative financial instrument. Finally, in Section 4.4. we present 
some simulation results using a real trace of traffic at a dial-up Internet access 
modem pool. For the sake of readability, detailed calculations are relegated to 
Sections 4.5. and 4.6.. Although not strictly necessary, a layman's grasp of financial 
markets is helpful in reading this chapter. 



4.1. Queueing Model 
4.1.1 Preliminaries 

Customers arrive in a Poisson stream of rate A, with iJ.d. bid prices distributed 
according to a distribution F (density /). Call durations are exponentially dis- 
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STAGE I 



STAGE II 




caU competed 



Figure 4-2: Pricing Mechanism 



tributed with mean 1/r. The queueing system is shown in Figure 4-1. It consists of 
two stages, with buffers of size B and C respectively. 

The first stage consists of a second price auction, where the winners enter the 
second stage, and the losers leave the system. Specifically, the first stage has an 
exponentially distributed service time r wi::: mean l//i. At a service completion 
instant, let m be the number of customers in the first stage and n the number in 
the second stage, at that time. The m first s* age customers are ranked according to 
their bid prices. The (C— n) highest bids are accepted into the second stage, and the 
remaining m — (C— n) are dropped. The bid z rice of the highest dropped bid defines 
the new spot market price, which is valid uzrJ the next batch. If no customers are 
dropped, i.e., m < C — n, then the spot market price is zero. Figure 4-2 shows the 
functioning of the mechanism. 

The second stago is a C server queue r~:h no waiting room, where each cus- 
tomer has an exponential service time (call ru ration) of mean 1/r. The arrivals 
axe according to n batch Poisson process. *.*--:» rate /i, and batch sizes distributed 
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according to {^mj^-o* Let p = A//x. For 0 < m < f?, 




P{m arrivals in r} 



(l + />)™ +1 ' 



and 



fa = 



OO 



X m fi 




In Section 4.1.2, the steady-state distribution of the occupancy of the second 
queue can be shown to be of product form. However, for the purpose of pricing 
the reservations, we need a (probabilistic) model of its transient behaviour in the 
future, given :he current price. This is done in Section 4.1 .3. 

4*1.2 Product-form solution 

Let the state of the second queue (admitted calls) be X = (J^i,...,Xa') ? where 

Xk = (Xk.i .Xk,M k )<> Xk,i is the bid price of the i-th player in the &-th batch, 

Mk is number of calls remaining from the &-th oldest batch, and A' is the number 
of batches present. Let N = J^k-i ^fc» the total number of active calls. The letters 
x,rrik, and r. will denote generic values of X y Mk and iV, respectively. 

X(t) is a rontinuous time Markov process. The transitions are of two types. On 
one hand we have departures of individual customers: 



.r — (^lj • • • ? ♦ • • t *ZT A' ) * X — (#i , . . . , Ci{Xk)) 



(4.1) 
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occuring with transition probability 

7(s,;r') = r 3 

where a(xk) = • • • > £*,t-i » £jm+i , . - • ? Xk,m k )- On the other hand, we have batch 
arrivals: 

X = (XIi-.^Xk) X* = {Xi 9 ...)XKiXK+l), (4.2) 

occuring with probability intensity 

7 (X,X / ) =^/?m K+1 II f( X K+u) 
t=l 

if 0 < tuk+i <C — n, and 
if 777^-4. x = C — n. 

Proposition 11 7%e equilibrium probability of being in state x = (xi y ...,xx) is 

»(*) = »(•) f rf-V n z-j- n /(^), (4.3) 

w/iere i/ze normalization constant is given by 

n ■ \K K 



{x:0<n<C} 



| + EE E E - £ — 45#Wr 



n=l A=l m 1= =l m 2 =l m,^^ m l m 2 • • • ™ A -1 (« - m *) 



" iiRiiiiiiwiwimpiiNpiiiiifti ' ii fn f||i i i|i ii i [| 
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Proof: To prove the theorem, we first analyze and then conjecture the transition rates 
7* of the reversed process, X(—t)* and then verify that the following balance conditions: 

:r( * b(x, x f ) = T(x f )j*(x\ x), (4.4) 

and Va;, 

5>(*,*0 = E7*(*,*'), (4-5) 

hold. Indeed, if (4.4) and (4.5) axe satisfied, then by Theorem 1.13 of [46], t is the 
equilibrium probability of the Markov process X(t). 

Conjecture: JT(— f) = x has the following properties: 

• new batches are formed by customers arriving according to a Poisson process of rate 

• arrivals into existing batch A* axe according to a Poisson process of rate A*(ar). 
Arriving customers have a bid price Li.d. with distribution F; 

• upon arrival into a batch, if the system is not full (n < C), the customer is placed 
with equal probability in any position within the batch (from 1 to + 1). If the 
system is full, the new customer departs immediately; 

• batch k has an exponentially distributed service time, with mean l/fik(x). 
The rates for the reverse process are the following. First, 

A 0 = ~ ft). 

Second, if n < C or k < A\ 




771 
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and 

fi k (x)p xfl<m k <B 
0 Hm k = B 

Third, when n = C, and k = A', i.e., for the last batch when the system is full. 

m=£ K i<*<™k mkl(m - m k )\ \ p J 

and 

X K (x) = rm K - ( 4 -6) 

Note that in this last case all the arrivals immediately depart. 

We begin by verifying (4.4). For the transitions of type (4.1). i.e., corresponding 
to departures in the forward process, consider a departure from the Ar-th batch. When 
mjt > 1. (4.3) yields 

TTlJk 1 

1 + p rrik 

Since m^-l<5-Lwe have /3 mjb _i = p mk ~ l /(l + p) m *,and therefore jifr(ar') = r m* - 
1)/(1 — p). On the right hand side the system is not full (n' = n - 1 < C - 1). hence 
Ajt(#') = p>k{2')p = t+jK 771 * - 1)- Thus we have 

^(^)r = x(xO^^/(^). 

Now, since arrivals in the reverse process take any position in their batch with equal proba- 
bility, Xk(x')jmk is precisely the arrival rate into position i in batch k* i.e., Xk(x f )/m^f(Xk,i) < 
7~(x\x). as desired. If in* = 1. (4.3) yields 



-[ xi , . . . , (Zk,i J, . - • , x K )r = tc(xi , . . . , x k -i , a?jb+i , . . . . x K ) —f- -f{x hA ) r 

1 -t p r 

= *( X! , . . . , , X k +l , X'A* ) A 0 /f ZJU ) , 



1^ 



1 1 'iflimimwiUBiiMWdifii"" '< 
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as desired. 

For the transitions of type (4.2), if 1 < ro#+i < C - 12k=i m *' 

H^)Pm K ^ II f( X K+hi) = ^O^K+l-i— — ) A»jc +1 /* 
tsl P \ P ' 

as desired. If mjr+i = C - Jl^i 771 

*(*)/* E ^F(^ ZK+UT*™*- Um ' mv V II 



= ir(x')/*jc+i(a?')» 



as desired. 

We now verify (4.5). The left-hand side is the total rate at which the forward process 
X(t) leaves state x, which is /x(l - A>) + nr. The right-hand side is the total rate of leaving 
state x in the reverse process. Consider first a state where the system is not full (n < C). 
The total rate is then (letting k range from 1 to A') 

A 0 + !>*(*) + //*(*) = Ai(l-A>) + (l + />) £ /*(*)+ £ 
= /i(l-/^) + £rmjfe = /i(l-/fo) + rn, 



rmjfc 
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as desired. Now when n = C, the rate is 

= /z(l-)9o) + (n-;^y + r7n^-/iK(^) + Mis:(^), 



where the second term is derived as in the n < C case above, and the third term comes 
from the definition (4.6). Then the right hand side is ^(1 - /3 0 ) + nr, as desired. □ 



4*1*3 Diffusion approximation 

We develop a tractable approximation for the queue occupancy process N u by scal- 
ing the system following the well-known heavy traffic approach [37, 9, 97, 98, 18], 
applied here to a system with batch arrivals. 

Let a = ^ 5 m = EM = £* =0 m/? m , and v 2 = E(M 2 ) = £* =0 m 2 /? m . Consider 
a sequence of scaled systems, indexed by / = 1, 2, . . with capacity = la+^Vla, 
where 7 is arbitrary, and batch arrival rate /i® = Ifi. Let 

ZP ft ^ (4.7) 

the translated and scaled version of the queue occupancy of the /-th system. The 
drift and diffusion coefficient respectively are 

««(*) = limE Z ^" Z ^ |ZP = z 



(4.8) 




160 



M[ipMWirimiMiiiifmf(r< 





102 



and 




(4.9) 



The basic idea is that as / — ► oo. 



, =^ Zt, (where => denotes weak convergence, 



i.e., the probability distributions converge), where Z t is the diffusion process which 
solves the stochastic differential equation 



where W t is a Brownian motion. 

Note that, for 7 < 00, the offered load cfl>fC® = (1 + -f/Vla)' 1 f 1, which 
makes it a "heavy traffic" approximation. 

The coefficients a(.) and cr 2 (.) are obtained by letting / -* 00 in (4.8) and (4.9) 
respectively. In the original approach of [37], Z® =4> Z t holds if a(.) and <r 2 (.) are 
continuous, which is clearly not the case here, since both have jumps at z = 7. 
However, based on a theorem of Borovkov [9], Whitt [98] proposes a conditioning 
heuristic to derive approximate steady-state blocking probabilities for the G/GI/s/0 
(i.e., 3 servers, no waiting room) queue from the diffusion approximation for the 
corresponding G/GI/ 00 system. Our approach here will be to apply the heuristic 
to the diffusion process itself 2 , conditioning on Z t < 7. 

2 [98] p. 694 states that, for exponential service times, the heuristic is also applicable to the 
diffusion process itself. 



dZt = a(Z t )dt + <T(Zt)dWt 



(4.10) 
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4.2- Diffusion Models of the Market Prices 
4.2.1 Spot Price 

Suppose that at time the queue occupancy is N t = n, and a batch of size rn arrives. 
The admission decision is made, and a new spot price results. In this section, we 
want to derive a model which will tell us in some sense the future evolution of this 
price. 

Consider the mechanism by which the price arises. Of the m new customers, 
those with the C — n highest bid prices will be admitted, and the spot price will 
be the (C — n + l)-th highest bid price. Recall that the bid prices are i.i.d., with 
distribution F, which we assume to be smooth. The probability that the price is x 
is the probability that in m draws, one will equal x, and of the remaining rn — 1, 
C — n will be greater than and (m — 1) — (C — n) will be less than or equal to a;, 
which is 

" ( C -n)!( TO - ! l-C + n)! ^ " " ^ 

The distribution is 



Jo 



-I 



F{£) tf «-C+n-I (1 _ u f~n d 

(C-n)!(m-l-C + n)! . 



where the last equality comes from substituting the previous expression, and making 
the change of variables u = F(ij). <lu = f(y)dy. 

Let us consider a "first-order" approximation, the expected spot price given the 



104 



occupancy n and the size of the batch arrival m: 



= r[l-F n , m (x)]dx 

Jo 

— If g(u,n,7n)dudx 
Jo JF(x) 

Jo 



where 



4.2.2 Diffusion model of the spot price 

For the purpose of pricing a reservation beginning at time i, we would like to char- 
acterize the future evolution of the market price, in terms of what is known at or 
just before t. namely the occupancy jV t . Thus, it is natural to consider the process 

B 

m=0 

Remark: We prefer the above expression to one that uses the information in 
the batch tha: arrives at time since computing the option prices based on the new 
arrivals wouic add to the delay in the admission decision. By not waiting for the 
new information, we can use the time in between batches for computation of option 
prices. Of course the spot price results fronfi the auction among the new bids, but 
that is a very simple computation. 

Now. we approximate the price by a sequence of functions of the centered and 
scaled process 

P w { = ) = 1>(z\/ai+al). 



0 
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Then, Ito ? s rule (see, e.g., [45]) along with (4.10) yields the stochastic differential 
equation for P: 



Since 



dz 2 dz 2 



dt + ^<r {l) dW t . (4.12) 



p(')( z ) =J2Prn f F- l (u)g®(u, V^lz + aL m) du, 

m=0 Jo 

where is defined as g with instead of C, it follows that 

= £ P m J* F-\u)2-gV\u,Valz + al,m)du, (4.13) 

and 

£ ^r(*) = E A. / + ™) *x. (4.14) 

Tn=0 ^ 

Now (4.12), along with (4.8), (4.9), (4.13) and (4.14), constitutes a diffusion 
model for the spot market price P u as / — ► co. From the results derived in Sec- 
tions 4.5. and 4.6., we can approximate (4.12) by dP t = 0 when z < 7, and 



dP t = £ A. frC (-IP, + + ±Af>) dt 

+ VrC Qp< - Af 0 ) dW^ , (4.15) 



when - = 7. V is a constant related to the accuracy to which we want to evaluate 
g, and can typically can be taken to be 1 (see Section 4,6.). Vm, the constants 
A"}" 1 *. Ao" 1 *, which depend only on the bid price distribution F. are defined (and 
can Uv computed) by (4.23) and (4.25) respectively. 

Remark; z = 7 corresponds to A* f = C, i.e., the system is full. Thus we have 
a model where the spot price remains constant when the system is below capacity, 
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and varies when the system is full. This corresponds to what one would intuitively 
expect from the auction mechanism (with the constant price 0 when the system is 
not full). Note that we do not attempt to model the transitions between the two 
regimes, so this model is meaningful only during periods when the system is almost 
always full. 

4,3 ♦ Computing Reservation Fees: the Derivative Market 

Taking the current market price P t as initial condition, the solution of (4,15) provides 
a stochastic model of the future prices {P r } T>t (this solution is unique in distribution 
- see [45]). From that, we now determine the price of a reservation, arriving at time 
with a holding time T, and a bid price p. The value, or "fair price", at time t of 
au option to buy a security for a "strike price" p at a specific future date 3 r > f is 



In our context, this concept must be extended in the following straightforward 
manner: we define the reservation as a hold option, a new kind of derivative instru- 
ment which is an option to buy repeatedly at every time instant from t to t + T. 
Thus the reservation fee should be 



If A'j = A' 2 = 0, ( r is given explicitly by the Black-Scholes formula [36]. For the 



(see [45, 36]) 



C. t E(P T - p)+. 




(4.16) 



3 This is called a European option, as opposed to an American option, which is the right to buy 
at any time between t and r. 
Note that we take the risk-free interest rate to be 0. 
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more general form (4.15), which we re-write as 

dP T = {AP r + K 3 ) dt + {DP T + K 4 ) dW t , (4.17) 
for r > t, with known initial condition Pt, the solution is (see [45], Section 5.6) 

P T = S r [P t + [ du + f t f± , (4.18) 



where 



5 T £ exp [(A - Z? 2 /2) (r - t) + jT £ dW,] 



Knowing the set of distributions 7^ = {P(P T < x|P t ), Vx : r > t} ? we can evalu- 
ate (V, for r > i and the reservation fee <j> follows. 

4.4. Simulations 

In this section, we simulate the pricing mechanisms presented in the preceding 

sections, applied to dial-up Internet access. 

Our data set consists of all the connections that were made to the Columbia 

University modem pool between April 4th and May 10th 1998. At that time, the 

sysiem, with a total of 316 lines, was very heavily loaded (except for the period 
roughly from 2AM to 7AM every day, essentially all lines are always busy). There 
was a total 669,994 calls, from 9451 different users, with an average inter-arrival time 
of 5.59 seconds, and an average call duration of 1299.3 seconds (see Figure 4-3). 

The unsatisfied demand, i.e., the calls which encounter busy signals, are not 
logged in our data trace, since the blocking occurs in the public telephone network. 
Our data trace only includes admitted calls. Thus feeding the trace into a system of 
316 lines would result in no blocking, which would not be interesting since blocked 




seconds hours 

Figure 4-3: Modem pool call statistics (solid) and equivalent - same mean - expo- 
nential distributions (dashed) 

calls are what gives rise to the market price. By scaling down the capacity of system 
in our simulation, we can recreate the congestion. The traffic was fed into a system 
running with 202 lines, of which 18 were allocated to the first stage, and 184 to 
the second stage. The call arrival times and bid durations are from the real traffic 
trace (however the simulation was run at lOx real-time speed), while bid prices are 
randomly generated from F. the uniform distribution on [0, 1). 

Rather than explicitly evaluating the set of distributions functions Vt resulting 
from (4.18), our technique consists of Monte-Carlo siumulation. Specifically, at each 
time when a reservation has to be computed, beginning at the known current price 
P u we generate sample paths by the difference equation corresponding to (4.15), 
which results in a set of histograms (one for each value of r) corresponding to V%. 
Figure 4-4 shows a representative example. 

Since the arrival process is clearly time- varying, the implemented reservation 
mechanism makes real-time estimat os of A. and r , to evaluate the constants A, A3, D 
and A' j. using the formulas in Section !.(>.. 

«-7 



109 




0.4 



0.2 



02. 



0.1 



12 



10 



12 



Figure 4-4: Histograms of P r , for Po = 0 and r = 0, 1, 2 minutes; the bins represent 
price levels [0, 0.1), [0.1, 0.2), . . . , [0.9, 1). 
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Figure 4-5: Simulation trace 
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Figure 4-5 shows a snapshot of the simulation trace. The first plot shows the 
number of users arriving in each batch interval where ~ = 1 minute. The second 
plot shows the spot price process resulting from the batch auctions. As expected, 
prices are zero when the system is lightly loaded during the night. The average price 
including days and nights is 0.2055. 

The third and fourth plots show the number of users in the second stage, in the 
reserved and unreserved states. In this simulation, all users accept the reservation 
offer, thus they only remain in the unreserved state for a brief period, from the time 
the offer is sent by the auctioneer (which is a server program), until the acceptance 
returns from the user (which is a client program on a different host). 

In the fifth plot, "value" is the sum of the bid prices in the second stage (re- 
served+unreserved): this represents the social welfare, or the total value that the 
users are getting from the system. Without pricing, i.e., if all the users were sim- 
ply admitted on a FCFS basis, the average value during peak periods would be 
184 x 0.5 = 92, since the average bid price is 0.5. Here the average value during the 
day is approximately 110, i.e., the pricing mechanism yields a 20% gain in efficiency. 
In other words, without pricing there are many times when a user who values access 
less is denying a user who values it more. The efficiency is shown more clearly in 
Figure 4-6. normalized so that the horizontal line corresponds to the FCFS system. 
Note that there is an efficiency gain of up to 10% even during off-peak hours, because 
some of the off-peak users arrived during peak hours. 

The sixth plot of Figure 4-5 shows the cumulative revenue from usage charges 
and reservation fees. Since the batch interval is one minute, and usage charges are 
assessed on a minutely basis, we take the price of 1 as the "full price" reference level, 
i.e., onr unit of revenue is when 1 line is charged a price of 1 for I minute. The 
unit is called an MOU (minute-of-use). As can be seen on the plot, the revenue for 

U1 
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Figure 4-6: Efficiency: 1.0 = first-come first-served system 

one day is approximately 125,000 MOUs. The 184 lines multiplied by 1440 minutes 
in a day yield about 264,960 potential minutes of use. Thus, on average, a modem 
line generates 0.47 MOUs of revenue per minute, including usage and reservation 
charges. 

Figures 4-7 and 4-8 show how the reservation fee offsets any attempt to "arbi- 
trage". Durations range from 0 to 24 hours, and the reservation fees range from 0 
to 640 MOUs, with the highest ratio being 1 MOU of reservation fee per minute of 
reservation duration, as can be seen by the straight line upper-bounding the scatter 
plot in Figure 4-7. The highest ratio (most expensive reservation) would be for a 
user who gets into the system with a bid price near zero and requests a long reser- 
vation at a time when the arrival rate is high. That is illustrated by Figure 4-8. 
A user asking for a long duration reservation at a low bid price must pay a higher 
fee which compensates for the expected future rise in prices. The vertical variations 
are due to the market price at the moment the reservation offer is made, where the 
higher end are reservations made when the market price is high. 

no 
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Figure 4-7: The reservation fee is at most Ix duration. 




Figure 4-S: Reservations are proportionally more expensive for low bid prices. 



113 



4.5. Approximations related to g 

We first seek a convenient approximation of (4.11). Fix V > 0. 

4.5.1 Case rnu(l - u) > V 2 

By the Central Limit Theorem, the binomial density tends to the normal density, 
i.e., for 0 < u < 1, we have 

k\(^ky U [1 ~ U) ~j2rmu(l-u) 2«(1 - u)m )' 



Thus, 



g(u,n,m) = 



ml 



ra - c + n - 1 (l-u) c_n 



(C7-n)!(ro-l-C + n)! 

m-C + n 1 „._f-{^- 



u ij2irmu(l — u) 



1 c::p f -[g-"-(i-w ) 

u(l-u) I 2u(l-u)m J 



Differentiation yields 



(tj,n ? m) « <?(u,n,m) 



1 C — 7i — m(l — u) 



m-C + n u(l — u)m 



and 



d 2 S, ^ e \ 

— (u,n,m) « g(u,n,rn) 



1 C — n — m(l — 

+ 



m — (C — n) ti(l — w)m 

Vm — C + nJ u(l — u)rn] 



Now, recalling that (7^ = /a + 7\/la, we get 



^^(u, \7o7r + a/, m) 



m — (7 — :)Vn/ 
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yJ2irmu(l — u) 



exp < 



" [(7 - z)Val - (1 - u)m] J 



2-u(l — u)m 



When 2 7^ 7, as / — ► co the exponential goes to zero faster than the other factors 
grow, thus fjj(u, + a/,ro),0(u, v^k + or/,m) -> 0. Therefore, by (4.13) 
and (4.14), ^(z) = (z) - 0. 
But when z = 7, 

*fl>(ti,VSfc + a/,m) - ! expi ~ (1 ~" )m | (4.19) 

yfal ( 1 _ I) VS7z + a/, m) (4.20) 
az \TTl u/ 



u, Valz + a/, m) ^ a/ (— — — ~ / v 

\m u) m z uil — u) 



[\m u/ m z u(l — u)m 
g^lu^Valz + al^ m) 



(4.21) 



as / — ► oo, where ~» indicates that it is an approximation valid for large m. 
4.5.2 Case mu(l - ti) < V 2 

Here we consider the neighborhoods of u = 1 and « = 0, where the normal approx- 
imation is no longer valid. Indeed 

g(lj n ? m) = ml^}(n), 



and 



where ls(x) = 1 if x € 5 and = 0, otherwise. 
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Thus, for u near 1, we write 



(/^(if, Vaiz + a/ : m) « ml/ 1 1 ] (z) -* ml^y(z) 



— g^ l \u, >/a/2 + a/, m) w ro 



32 



2 ^W(«, va/z 4- a/, to) « m 



o, 



where 5 is the Dirac-£ function. 

For u near 0, we get the same approximation with 7 replaced by 7 — ^gi, which 
becomes identical to the above as / — » co. 

HereVz, = £/*>(*) -> 0. 



4.6. Approximating the Drift and Diffusion Coefficients 

When z ^ 7, by the results of 4.5.1 and 4.5.2, the right-hand side of both (4.13) 
and (4.14) — > 0, as / — ► 00. Therefore, by the finiteness of (4.8) and (4.9), the 
right-hand side of (4.12) — > 0, as / — y 00. 

Thus we need only be concerned with the regime 2 = 7. Let 

A l-y/l-4V*fm 

U V,m = 2 5 

which satisfies av t7n (l — wv jm )m = F 2 . V can be interpreted as the smallest variance 
at which the binomial distribution is sufficiently close to the normal distribution. 
Typically V can be taken to be about 1. 

Now we will approximate g and its partials in three regions: [0,uv )7n )* [ur.mi 1 — 
uv.m], and (1 - ttv. m -l]- 

In the first find last regions, as shown in Section 4.5.2, the coefficients tend 
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to 0, Thus we need only focus on the middle region, where the appoximations of 
Section 4.5.1 apply. Also, if m < 4 V 2 , uv> is not real, and in fact we are always in 
the case of Section 4.5.2, where the coefficients tend to zero. Thus we have non-zero 
terms only when m > 4 V 2 . 

Writing a® = -r(7 + v7a) = -rC/Vte, and plugging (4.20) into (4.13), we get 



a (0 « - / W ' m F~ l (u)rC (---) 9 {l) {u, Vriz + a/,m) du, 

_ -l£p(0 + r cjsr< m >, (4.22) 



as I — * oo, where 

. /f"' f-. ( _^_)!!f±=A^d» (4.23) 

where the second line follows from a change of variables w = y^(l — u)mfu, 



and 



Similarly, using <r (/)2 = r(l + i/Vla) = rC/(/a), and plugging (4.21) into (4.14) 
leads to 

-— -Cr ( '' 2 « / F \v)tC — + — - — — X 

dz 2 y UViJn \Bu u 2 u{l — u)B) 

ml f —(1 — «)m] , 
exp < \ du, 

u yj2xmu(l - a) I 2u J 
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rC(-2Ki m) /B + Ki m) ) 



(4.24) 



as / — ► oo, where 



Ju Vtm V \ (1 - u)B) 

-(1 - u)m 



u y2xrnw(l — u) 
(») / 



2u 



id 2 + m V 1 




Finally, 



dz 



-e-^^dw. (4.25) 



« £ UV ' m F-\u)yfiC (jj - g (,) (u, Valz + a/, B) du, 



Substituting all of the above into (4.12) yields (4.15). 
4.6.1 Example: F uniform 

Suppose F is the uniform distribution on [0, P}. Then F~ 1 (-^^) - P^?, hence 



A? 



(») 



= IP [*(w\ m) ) - $(4 m) )] , 



where $ is the normal distribution of mean zero, and variance 1. Also, 



(m) fif 



m 



(to) 



-,,< m > 2 /2 f 2£i 



(m) 



m 



(to) 



+ 2A'J 



\1L 
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4.7. Optimal Seller Strategies 

In this section, by way of conclusion, we briefly outline approaches to the comple- 
mentary control problem, i.e., how the seller can "tune" the mechanism in order to 
optimize some system- wide objective. 

Taking the above mechanism as a given, consider the optimal strategy for the 
network manager, whose objective is to maximize revenue, using /i(P) 6 [0,oo] 
as the (feedback) control. As the spot market is governed by auctions, P = 0 
whenever the system is not full. The relevance of the problem is during periods 
when the system is heavily loaded (i.e., almost always full). Then, the objective is 
equivalent to minimizing 



over all admissible controls fi(t) = fi[P]. r is the stopping time r = inf {i > s : P t = 
0}, and the initial state is P s = y. The dynamic programming equation is 



If W is a solution of the above equation, then the optimal cost is J(s,y,/i*) = 
W(s,y), and, in the classical approaches, the optimal control is obtained by per- 
forming the minimization over fi. In this case, even when W can be solved for, 
or guessed, for each y finding the control fi(y) is in general not tractable since it 
involves polynomials of degree B\. 

There are a number of promising approaches to this control problem. It is inter- 




0= — + m\n{AW -y), 
os u> 



for y > 0, and W(s,Q) = 0* The differential operator A is given by [20] 
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esting to investigate the applicability of approximation techniques by discretizing 
the original problem, and solving as a Markov-decision problem, in the manner 
of [53] as well as that of [81], and compare the resulting performance with the opti- 
mal open-loop control. One should also consider some heuristic solutions based on 
related but simpler solvable problems, and investigate the relationship of the opti- 
mal control with the notion of Brovmian local time, as in [11], Finally, the trade-offs 
between optimality and computational complexity should be investigated. 
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